87 research outputs found

    Transposable element distribution, abundance and role in genome size variation in the genus Oryza

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The genus <it>Oryza </it>is composed of 10 distinct genome types, 6 diploid and 4 polyploid, and includes the world's most important food crop โ€“ rice (<it>Oryza sativa </it>[AA]). Genome size variation in the <it>Oryza </it>is more than 3-fold and ranges from 357 Mbp in <it>Oryza glaberrima </it>[AA] to 1283 Mbp in the polyploid <it>Oryza ridleyi </it>[HHJJ]. Because repetitive elements are known to play a significant role in genome size variation, we constructed random sheared small insert genomic libraries from 12 representative <it>Oryza </it>species and conducted a comprehensive study of the repetitive element composition, distribution and phylogeny in this genus. Particular attention was paid to the role played by the most important classes of transposable elements (Long Terminal Repeats Retrotransposons, Long interspersed Nuclear Elements, helitrons, DNA transposable elements) in shaping these genomes and in their contributing to genome size variation.</p> <p>Results</p> <p>We identified the elements primarily responsible for the most strikingly genome size variation in <it>Oryza</it>. We demonstrated how Long Terminal Repeat retrotransposons belonging to the same families have proliferated to very different extents in various species. We also showed that the pool of Long Terminal Repeat Retrotransposons is substantially conserved and ubiquitous throughout the <it>Oryza </it>and so its origin is ancient and its existence predates the speciation events that originated the genus. Finally we described the peculiar behavior of repeats in the species <it>Oryza coarctata </it>[HHKK] whose placement in the <it>Oryza </it>genus is controversial.</p> <p>Conclusion</p> <p>Long Terminal Repeat retrotransposons are the major component of the <it>Oryza </it>genomes analyzed and, along with polyploidization, are the most important contributors to the genome size variation across the <it>Oryza </it>genus. Two families of Ty3-<it>gypsy </it>elements (<it>RIRE2 </it>and <it>Atlantys</it>) account for a significant portion of the genome size variations present in the <it>Oryza </it>genus.</p

    Exceptional lability of a genomic complex in rice and its close relatives revealed by interspecific and intraspecific comparison and population analysis

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Extensive DNA rearrangement of genic colinearity, as revealed by comparison of orthologous genomic regions, has been shown to be a general concept describing evolutionary dynamics of plant genomes. However, the nature, timing, lineages and adaptation of local genomic rearrangement in closely related species (<it>e.g</it>., within a genus) and haplotype variation of genomic rearrangement within populations have not been well documented.</p> <p>Results</p> <p>We previously identified a hotspot for genic rearrangement and transposon accumulation in the <it>Orp </it>region of Asian rice (<it>Oryza sativa</it>, AA) by comparison with its orthologous region in sorghum. Here, we report the comparative analysis of this region with its orthologous regions in the wild progenitor species (<it>O. nivara</it>, AA) of Asian rice and African rice (<it>O. glaberrima</it>) using the BB genome <it>Oryza </it>species (<it>O. punctata</it>) as an outgroup, and investigation of transposon insertion sites and a segmental inversion event in the AA genomes at the population level. We found that <it>Orp </it>region was primarily and recently expanded in the Asian rice species <it>O. sativa </it>and <it>O. nivara</it>. LTR-retrotransposons shared by the three AA-genomic regions have been fixed in all the 94 varieties that represent different populations of the AA-genome species/subspecies, indicating their adaptive role in genome differentiation. However, LTR-retrotransposons unique to either <it>O. nivara </it>or <it>O. sativa </it>regions exhibited dramatic haplotype variation regarding their presence or absence between or within populations/subpopulations.</p> <p>Conclusions</p> <p>The LTR-retrotransposon insertion hotspot in the <it>Orp </it>region was formed recently, independently and concurrently in different AA-genome species, and that the genic rearrangements detected in different species appear to be differentially triggered by transposable elements. This region is located near the end of the short arm of chromosome 8 and contains a high proportion of LTR-retrotransposons similar to observed in the centromeric region of this same chromosome, and thus may represent a genomic region that has recently switched from euchromatic to heterochromatic states. The haplotype variation of LTR-retrotransposon insertions within this region reveals substantial admixture among various subpopulations as established by molecular markers at the whole genome level, and can be used to develop retrotransposon junction markers for simple and rapid classification of <it>O. sativa </it>germplasm.</p

    DNA methylation changes facilitated evolution of genes derived from Mutator-like transposable elements

    Get PDF
    Supplementary file S2. Accession numbers and URLs for genome assembly, transcriptome and methylome data that used in this project. (DOCX 101 kb

    The Oryza BAC resource: A genus-wide and genome scale tool for exploring rice genome evolution and leveraging useful genetic diversity from wild relatives

    Get PDF
    Rice was the first crop to have a high-quality reference genome sequence and is now at the forefront of intense functional and evolutionary research for two reasons-its central role in world food security, and its status as a model system for grasses. A thorough characterization of the rice genome cannot be accomplished without a deep understanding of its evolutionary history. The genus Oryza contains two cultivated and 22 wild rice species that represent 10 distinct genome types embedded within a robust phylogeny spanning a ~15 million year time span. The genus contains an untapped reservoir of agriculturally important traits and a historical record of genomic changes (especially those related to domestication, polyploidy, speciation and adaption).The two main objectives of the 'Oryza Map Alignment Project' (OMAP) were to functionally characterize the rice genome from a comparative standpoint and to provide essential tools to leverage the novel genetic diversity from wild relatives for rice improvement. The objective of this review is to summarize our efforts towards developing the most comprehensive genus-wide set of publicly available BAC resources for the genus Oryza, the first of its kind among plants (and perhaps higher eukaryotes), and their applications

    An Integrated Physical, Genetic and Cytogenetic Map of Brachypodium distachyon, a Model System for Grass Research

    Get PDF
    The pooid subfamily of grasses includes some of the most important crop, forage and turf species, such as wheat, barley and Lolium. Developing genomic resources, such as whole-genome physical maps, for analysing the large and complex genomes of these crops and for facilitating biological research in grasses is an important goal in plant biology. We describe a bacterial artificial chromosome (BAC)-based physical map of the wild pooid grass Brachypodium distachyon and integrate this with whole genome shotgun sequence (WGS) assemblies using BAC end sequences (BES). The resulting physical map contains 26 contigs spanning the 272 Mb genome. BES from the physical map were also used to integrate a genetic map. This provides an independent vaildation and confirmation of the published WGS assembly. Mapped BACs were used in Fluorescence In Situ Hybridisation (FISH) experiments to align the integrated physical map and sequence assemblies to chromosomes with high resolution. The physical, genetic and cytogenetic maps, integrated with whole genome shotgun sequence assemblies, enhance the accuracy and durability of this important genome sequence and will directly facilitate gene isolation

    Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana

    Get PDF
    We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is ~32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene

    A draft physical map of a D-genome cotton species (Gossypium raimondii)

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Genetically anchored physical maps of large eukaryotic genomes have proven useful both for their intrinsic merit and as an adjunct to genome sequencing. Cultivated tetraploid cottons, <it>Gossypium hirsutum </it>and <it>G. barbadense</it>, share a common ancestor formed by a merger of the A and D genomes about 1-2 million years ago. Toward the long-term goal of characterizing the spectrum of diversity among cotton genomes, the worldwide cotton community has prioritized the D genome progenitor <it>Gossypium raimondii </it>for complete sequencing.</p> <p>Results</p> <p>A whole genome physical map of <it>G. raimondii</it>, the putative D genome ancestral species of tetraploid cottons was assembled, integrating genetically-anchored overgo hybridization probes, agarose based fingerprints and 'high information content fingerprinting' (HICF). A total of 13,662 BAC-end sequences and 2,828 DNA probes were used in genetically anchoring 1585 contigs to a cotton consensus genetic map, and 370 and 438 contigs, respectively to <it>Arabidopsis thaliana </it>(AT) and <it>Vitis vinifera </it>(VV) whole genome sequences.</p> <p>Conclusion</p> <p>Several lines of evidence suggest that the <it>G. raimondii </it>genome is comprised of two qualitatively different components. Much of the gene rich component is aligned to the <it>Arabidopsis </it>and <it>Vitis vinifera </it>genomes and shows promise for utilizing translational genomic approaches in understanding this important genome and its resident genes. The integrated genetic-physical map is of value both in assembling and validating a planned reference sequence.</p

    A physical map for the Amborella trichopoda genome sheds light on the evolution of angiosperm genome structure

    Get PDF
    Background: Recent phylogenetic analyses have identified Amborella trichopoda, an understory tree species endemic to the forests of New Caledonia, as sister to a clade including all other known flowering plant species. The Amborella genome is a unique reference for understanding the evolution of angiosperm genomes because it can serve as an outgroup to root comparative analyses. A physical map, BAC end sequences and sample shotgun sequences provide a first view of the 870 Mbp Amborella genome.Results: Analysis of Amborella BAC ends sequenced from each contig suggests that the density of long terminal repeat retrotransposons is negatively correlated with that of protein coding genes. Syntenic, presumably ancestral, gene blocks were identified in comparisons of the Amborella BAC contigs and the sequenced Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa genomes. Parsimony mapping of the loss of synteny corroborates previous analyses suggesting that the rate of structural change has been more rapid on lineages leading to Arabidopsis and Oryza compared with lineages leading to Populus and Vitis. The gamma paleohexiploidy event identified in the Arabidopsis, Populus and Vitis genomes is shown to have occurred after the divergence of all other known angiosperms from the lineage leading to Amborella.Conclusions: When placed in the context of a physical map, BAC end sequences representing just 5.4% of the Amborella genome have facilitated reconstruction of gene blocks that existed in the last common ancestor of all flowering plants. The Amborella genome is an invaluable reference for inferences concerning the ancestral angiosperm and subsequent genome evolution

    Uncovering the novel characteristics of Asian honey bee, Apis cerana, by whole genome sequencing

    Get PDF
    This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.Abstract Background The honey bee is an important model system for increasing understanding of molecular and neural mechanisms underlying social behaviors relevant to the agricultural industry and basic science. The western honey bee, Apis mellifera, has served as a model species, and its genome sequence has been published. In contrast, the genome of the Asian honey bee, Apis cerana, has not yet been sequenced. A. cerana has been raised in Asian countries for thousands of years and has brought considerable economic benefits to the apicultural industry. A cerana has divergent biological traits compared to A. mellifera and it has played a key role in maintaining biodiversity in eastern and southern Asia. Here we report the first whole genome sequence of A. cerana. Results Using de novo assembly methods, we produced a 238 Mbp draft of the A. cerana genome and generated 10,651 genes. A.cerana-specific genes were analyzed to better understand the novel characteristics of this honey bee species. Seventy-two percent of the A. cerana-specific genes had more than one GO term, and 1,696 enzymes were categorized into 125 pathways. Genes involved in chemoreception and immunity were carefully identified and compared to those from other sequenced insect models. These included 10 gustatory receptors, 119 odorant receptors, 10 ionotropic receptors, and 160 immune-related genes. Conclusions This first report of the whole genome sequence of A. cerana provides resources for comparative sociogenomics, especially in the field of social insect communication. These important tools will contribute to a better understanding of the complex behaviors and natural biology of the Asian honey bee and to anticipate its future evolutionary trajectory
    • โ€ฆ
    corecore